CLAREMONT McKENNA COLLEGE SCALABLE COLLABORATIVE FILTERING RECOMMENDATION ALGORITHMS ON APACHE SPARK SUBMITTED TO PROFESSOR DEANNA NEEDELL AND DEAN
نویسنده
چکیده
Collaborative filtering based recommender systems use information about a user’s preferences to make personalized predictions about content, such as topics, people, or products, that they might find relevant. As the volume of accessible information and active users on the Internet continues to grow, it becomes increasingly difficult to compute recommendations quickly and accurately over a large dataset. In this study, we will introduce an algorithmic framework built on top of Apache Spark for parallel computation of the neighborhood-based collaborative filtering problem, which allows the algorithm to scale linearly with a growing number of users. We also investigate several different variants of this technique including user and item-based recommendation approaches, correlation and vector-based similarity calculations, and selective down-sampling of user interactions. Finally, we provide an experimental comparison of these techniques on the MovieLens dataset consisting of 10 million movie ratings.
منابع مشابه
Gains from diversification on convex combinations: A majorization and stochastic dominance approach
By incorporating both majorization theory and stochastic dominance theory, this paper presents a general theory and a unifying framework for determining the diversification preferences of risk-averse investors and conditions under which they would unanimously judge a particular asset to be superior. In particular, we develop a theory for comparing the preferences of different convex combination...
متن کاملImproved immunogenicity of tetanus toxoid by Brucella abortus S19 LPS adjuvant.
BACKGROUND Adjuvants are used to increase the immunogenicity of new generation vaccines, especially those based on recombinant proteins. Despite immunostimulatory properties, the use of bacterial lipopolysaccharide (LPS) as an adjuvant has been hampered due to its toxicity and pyrogenicity. Brucella abortus LPS is less toxic and has no pyrogenic properties compared to LPS from other gram negati...
متن کاملSteady electrodiffusion in hydrogel-colloid composites: macroscale properties from microscale electrokinetics.
A rigorous microscale electrokinetic model for hydrogel-colloid composites is adopted to compute macroscale profiles of electrolyte concentration, electrostatic potential, and hydrostatic pressure across membranes that separate electrolytes with different concentrations. The membranes are uncharged polymeric hydrogels in which charged spherical colloidal particles are immobilized and randomly d...
متن کاملPerturbative Analysis of Dynamical Localisation
In this paper we extend previous results on convergent perturbative solutions of the Schrödinger equation of a class of periodically timedependent two-level systems. The situation treated here is particularly suited for the investigation of two-level systems exhibiting the phenomenon of (approximate) dynamical localisation. We also present a convergent perturbative expansion for the secular fre...
متن کاملCollinear contextual suppression
The context of a target can modulate behavioral as well as neural responses to that target. For example, target processing can be suppressed by iso-oriented surrounds whereas it can be facilitated by collinear contextual elements. Here, we present experiments in which collinear elements exert strong suppression whereas iso-oriented contextual surrounds yield no contextual modulation--contrary t...
متن کامل